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Circuit Ketwork Analysis Using Adaptive Algebraic Multigrid 

Approach 

[0001] This application relates to analysis of circuit 
5 networks such as power networks and clock networks- This 

application includes two Attachments I and II on pages 4-13 as 
part of this application, 

[0002] The network analysis methods described here may be 
generally categorized as algebraic multigrid (AMG) methods. An 

10 AGM method can be used to handle circuit networks with irregular 
structures. The present network analysis methods significantly 
improve the capabilities of the conventional AGM methods in 
various aspects. For example, adaptive grid structures and 
adaptive smoothing operations are used to capture the multi-rate 

15 behavior and circuit latency in various networks. The circuit 
spatial and temporal latency are fully explored along with the 
mutli-rate behavior to avoid any unnecessary computation without 
losing accuracy and convergence of the analysis. The present 
methods also have the ability to analyze coupling effects with 

20 electromagnetic retardations for high-frequency chi-package- 
board scale analysis. Hence, as one example, the effects of 
inductance in the circuits can be included in the analysis. 
[0003] More detailed description is included in Attachments I 
and II, where exemplary implementations in Attachment I on pages 

25 4-7 are referred to as ''Adaptive AMG," and exemplary 
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implementations in Attachment II on pages 8-13 are referred to 

as "SPICEDiego-" Several exemplary implementations of the network 

analysis methods described here are summarized as follows. 



5 1. A method for analyzing a circuit network, comprising: 

representing a circuit network by using a matrix of nodes 
having fine nodes and coarse nodes; 

applying an adaptive coarse grid construction procedure to 
assign grid nodes in the matrix as either coarse grid nodes or 
10 fine grid nodes according to (1) circuit activities and (2) a 
matrix structure of the matrix, where a node with a maximum 
potential is assigned as a coarse node and an unassigned node 
adjacent to a coarse node is assigned as a fine node; and 

applying iterative smoothing operations at selected local 
15 fine grids corresponding to active regions at a finest level 
obtained in the adaptive coarse grid construction procedure. 

2. The method as in 1, wherein coarse grid nodes are 
divided into non-adaptive coarse nodes which are selected 

20 according to the matrix and adaptive coarse nodes which are 
selected according to circuit activities. 

3. The method as in 2, wherein, in assigning non-adaptive 
coarse nodes, a potential of each neighboring node of a newly 
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assigned fine node is increased by one unit before a next level 

of assigning coarse and fine grid nodes so that each fine node 

has at least one neighboring coarse node. 

4. The method as in 2, wherein an adaptive coarse node I 
selected according to a first-order derivative of a nodal 
voltage. 
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AnACHMEHT I 

Power Network Analysis Using an Adaptive Algebraic 

Multigrid Approach 



ABSTRACT 

In this paper, we introduce an efficient analysis method for the 
power network of general topology. The new approach is based 
on algebraic multigrid (AMG) method that can avoid the slow 
convergence of basic iterative methods. An innovative adaptive 
coarsening and error-smoothing scheme is employed to further 
speed up the performance, taking advantage of the spatial 
variation of power supply noise. Experimental results show that 
our method is more than 100 times faster than SPICE3. 

Categories and Subject Descriptors 

[Design Aids]: Simulation 

General Terms 

Algorithms, Performance, Experimentation 

Keywords 

Algebraic multigrid method, Adaptive analysis, Power 
distribution network. Circuit Simulation 

1. INTRODUCTION 

Power network analysis covers voltage drop, voltage oscillation, 
and electromigration [1.2]. Excessive voltage drops reduce the 
switching speed as well as the noise margins of circuits and cause 
logic failures. Electromigration can decrease the chip lifetime [3]. 
Previous on-chip power grid analysis focuses on IR-drop caused 
by the resistance of power network. With the signal frequency 
increasing rapidly, the jaxLof on-chip power network becomes 
comparable with the resistance. Hence the inductance effect 
cannot be ignored. Moreover voltage oscillation may occur when 
power netwoik resonance frequency drops to the range of the 
signal frequency [5]. 

One bottleneck of power net^-ork analysis is the tremendous 
amount of elements in power network. Direct methods such as LU 
decomposition used in SPICE are thus prohibitive. To address this 
bottleneck, M. Zhao and Y. Cao used hierarchical macro 
modeling or model order reduction techniques in [6][7J; H. Chen 
divided the chip into 100^100 cquipotential segments to reduce 
the complexity in [8]; Y. Lee adopted the ADl concepts in [9]. 



Some sparse matrix techniques have also been applied to the 
power network simulation. T. Chen employed a Prccondidoncd 
Conjugate Graidient (PCG) solver in [10.18] and Nassif developed 
a multigrid method in [I l][12]. 

For typical power grid, the RLC values arc quite uniform at the 
same layer. But due to the non-uniform power densities and the 
timing of the switching events, power supply noise exhibits 
spatial variation, which means some nodes of the power network 
have more rapid nodal voltage changes, or in other words, these 
itodcs arc more active than the rest circuit 

In this paper, we present an adaprive approach for power grid 
analysis based on algebraic multigrid (AMG) method [IS]. There 
are two kinds of multigrid methods: geometric multigrid and 
algebraic multigrid. Geometric multigrid requires regular mesh 
structure but has less coarse grid reduction cost while algebraic 
multigrid (AMG) can handle problems in general topology but 
requires heavier coarsening ovofaead [15]. 

Adaptive concepts can be integrated with the coarsening scheme 
and smoothing operation in multigrid method. Some adaptive 
methods like FAC [19] and MLAT [20] can be applied to 
geometric multigrid method. The basic idea behind these adaptive 
methods is that active regions should have finer grid strucnjre, 
since active subnetworks need more computation to model their 
behavior accurately. 

The main differences between our approach and the multigrid 
method in [ 11 ][ 1 2] are 

1) Our approach covers self and mutual inductances, while 
[11][I2] can only handle RC networks. This is due to the 
limitation of geometric muldgrid coarsening algorithm used in 
[ll][12j. Since multigrid method requires the system matrix 
to be synunetric positive definite (S.P.D.) which circuits with 
induaors cannot satisfy using Modified Nodal Analysis 
method (MNA), reformulation as discussed in section 2 is 
needed. But the topology of reformulated system matrix is no 
longer die same as original circuit topology. It is not easy to 
derive coarse level grid directly from circuit topology in this 
kind of situation even if the power network is a regular mesh. 

2) Our coarsening scheme makes no assumption about the 
topology of power network. While in [ll](121, special 
consideration is needed for irregular power networks. 

3) [11][I2] ignored the error smoothing operation and only 
executed one multigrid iteration cycle. Although it may not 
result in much error for well -designed power network, 
accuracy caimot always be guaranteed for general cases. We 
keep the smoothing operations and iterate the multigrid cycles 
until the norm of the residue is less than a threshold. 
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4) We adopt an adaptive coarsening and smoothing scheme, 
which makes it possible to assign more computation to the 
active subnetwoi^s and reduce the unnecessary computation 
for inactive subnetworks. 

2. OVERVIEW 

Wc focus on power source nct>vork with the assumption that 
power and ground can be separated. Usually power network is an 
irregular mesh as shown in Figure 2.1. Every intersection node 
has a ground capacitance. Between neighboring intersection nodes 
is resistor or resistor and irKhictor in scries. Mutual inductance can 
also be included. The devices with activities are modeled as time- 
vaiyingcujTcnl sources [8]. 

When only RC model is included, system equation can be 
formulated as 

CX{i)^OX{t) = U{t)* (2-0 
where X is the vector of nodal voltages. 

Appling trapezoidal approximation with time step h to equation 
(2.1), we have 

{C + ^QXO + = -(G - ^C)X{t) + U{t) 4- U{t + h) ^2.2) 
a h 
The left hand side matrix in (2.2) is symmetric and positive 
defifiite which makes the iterative methods converge quickly. 
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Figure 2.1 Power Network Structure 



When inductance 
becomes: 



is included, the system matrix equation 

CX{t)^CX{f) = 0{i) (2.3) 



where , 



Equation (2.3) can be rewritten as: 



[: -.IK] 



(2.4) 



Appling trapezoidal approximation with time step h, die soludon 
of equation (2.4) is derived by 

Although MNA mediod can handle elements without admittance 
description, the transient analysis system matrix in (2.S) is no 
longer symmetric and positive definite when the inductance is 
included because of the introduction of current variables. 

Since Mulligrid as well as PCG method requires the matrix to be 
symmetric positive definite, some extra processing is needed to 
reformulate the system [16). Similar lo the method used in [10], 
wc split the variable vector into nodal voltage vector and branch 
cuireni vector Using block matrix operation!;, we decompose 
(2.5) into two iteration formulas (2.6) for nodal voltages and 
branch cuiients, respectively. 



h 



(2.6) 



[Y-C + |Y^-'.<,)K(i)+[t/(/)*t/(i*A)UW//(n 
- l{f + 6) = /(O - ^ L~ ^AfVif + A) ♦ K{/)1 

In equation (2.6), corresponds to the K matrix [17], where die 
matrix inversion overhead is reduced by sparsification methods. 

If the inductance matrix is symmetric and positive definite 
(S.P.D.), wc can prove that the system matrix is still S.P.D. This 
also holds for forward Euler, backward Euler integration 
approximation methods. Note d»at the topology of reformulated 
system nialrix in (2.6) is no longer die same as original circuit 
topology. Geometric based coarse grid redaction algorithm 
cannot be applied to RLC network directly. 

3. MULTIGRID METHOD 

The nniltigrid method is an efficient technique first widely used 
for solving partial differential equations [IS]. The basic idea of 
multigrid method is lo map the hard-to-damp low frequency error 
at fine level to casy-to-damp high frequency error at coarse level, 
solve the mapped problem at coarse level, and then map the error 
correction of coarse level back to fine level. The mapping 
operator from fine to coarse level is called restriction operator /» . 

and the mapping operator from coarse to fine level is called 
interpolation operator . Here the subscription represents the 

level. At each level, high frequency cnor is erased by a smoothing 
operator that is a forward iterative method such as Gauss-Seidel. 
The construction of hierarchical grid structure stops at the level 
when the reduced matrix can be quickly solved by LU 
decomposition. Once we find the exact solution of the problem at 
the coarsest level, wc perform the interpolation from the coarsest 
level back to the find grid. An iteration of restriction from the fine 
grid to the coarsest grid and interpolation from the coarsest grid 
back to the fine grid is called a single multigrid cycle, which will 
iterate several times till converge. 

3*1 Geometric Multigrid vs. Algebraic 
Multigrid 

There ore two kinds of multigrid methods: geometric multigrid 
and algebraic multigrid (AMO). Geometric multigrid method 
requires regular mesh structure. Its coarse grid reduction 
algorithm is defined geometrically, eg. coarse node is selected 
every other fine node. 

If we want to handle problems with irregular structure, AMG is a 
good alternative of geometric multigrid method. The coarsening 
and interpolation operation of AMG are based on matrix itself. 
This overhead makes AMG less cfTicient than geometric multigrid 
if the problem analyzed has regular mesh structure. 

In geometric multigrid, the smoothing operation at each level is 
important to the convergence. The error after smoothing should be 
geometrically smooth, in other words, the local error among 
adjacent nodes should be small. While in AMG, the interpolation 
plays the crucial role. The convergence rate actually depends on 
the coarse grid construction and inter-level mapping operators 
[14]. 
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3.2 AMG Interpoladon & Restriction 
Operator 

Stocc AMG has no grid concept in mind, the inter-level mapping 
operators have co be detennined based on the matrix. In AMG. 
smooth error means error components with relatively small 
residuals (3.1) [IS], which means after several iterations of 
smoothing operation, the residue is small but the error decreases 
very slowly. 

Ae^O (3.1) 
Equation (3.1) can be rewritten as (3.2). Hence the error of fine 
node can be well represented by linear combinations of its 
neighbors' error. 

If coarse and fine nodes have already been defined, we can further 
approximate the error of fine node by errors of only its coarse 
node neighbors. Although then; arv many other complicated 
interpolation methods, we find that this simple approach works 
well for the power grid problem. From (3.2), wc can construct the 
interpolation operator /^^ . Because of the symmetry of the 

mapping between levels in two directions, the restriction operator 
/2*is the transposition of the interpolation operator, i.e./Jj^ = z^** . 
The coarse level matrix can be computed as y|» = i^^A^i^ - 

Consequently, coarse level matrix A^* remains synunetric 
posidve definite. 

If the matrix is symmetric and positive definite, convergence of 
AMG can be guaranteed as long as the smoothing operation 
converge at eadi level. A detailed proof can be found in [13]. 

4. Adaptive ALGEBRAIC MULTIGRID 

In this section, we discuss the adaptive coarse grid constmction 
procedure and the adaptive smoothing operation. 

4.1 Adaptive Coarse Grid 

In our adaptive coarsening scheme, active regions have relatively 
finer grid at coarse level. The coarse grid nodes are determined 
adaptively by the circuit activities as well as by the matrix 
structure. The coarse grid consists of two kinds of nodes: non- 
adaptive and adaptive coarse nodes. Non-adaptive coarse nodes 
are selected by coloring scheme [15] according to the matrix. 
Adaptive coarse nodes arc determined by the circuit activities. 

4, L J Coloring Scheme for Non-Adaptive Coarse 
Nodes 

A iwo-level coloring scheme is explained below for sake of 
clarity. In practice, the coloring scheme is executed at every level 
except the coarsest one. 

Initially, the potential of each node is set to its degree. The node 
with the maximum potential is selected to be a coarse node and all 
its unassigned neighboring nodes are set as fine nodes. For each 
newly set fine node, we increase the potential of its neighbor by 1. 
The process is repeated until every node has been assigned as fine 
or coarse node. At the end, each fine node has at least one 
neighboring coarse node. 

4. 1.2 Adaptive Coarse Nodes Selection 



Adaptive coarse nodes are selected according to the activities of 
circuits. One good candidate for the measurement of the impact of 
circuit activities is the first order derivative of nodal voltages diat 
can be approximated fzam equation (2.4) for RLC networtc. 

For RLC networks as shown in Fig 2. 1 , since not every node has a 
ground capacitor, we split voltage vector V into two separate 
voltage vectors Vi and V2, where V] is the set of nodes with 
ground capacitor and V2 is die set of nodes between resistance 
and inductance in branches. We then calculate the first order 
derivative of nodal voltage in Vi. 

Rewriting equation (2.4), we have 

(4.1) 
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The first order derivative of intersection node voltage can be 
approximated as 

V,it + A) = -<r\G,,V,{l) ♦ C„K,{f) - A^Ai) * U(f ♦ A)l ^^^^ 

The inversion of capacitance matrix is easy because the 
capacitance matrix is aOually a diagonal matrbc. 

Those nodes with relatively larger voltage derivative are selected 
as adaptive coarse nodes. We apply adaptive coarsening at the 
finest level. The coarse grid selection at odier levels is determined 
solely by die coloring scheme, because active components with 
finer firsi level coarse grid will still have relatively finer grid on 
the next coarse levels. Figure 4.1 shows the comparison of non- 
adaptive and adaptive coarsening structure. Darker color means 
coarser level. The circuit activity focuses on the up-right comer of 
the circuits. Active region has finer grid than die odier region in 
every grid level. 




(left) 



Adaptive (right) 



Figure 4.1 Non-Adaptive 
Coarsening Structure 

4.2 Adaptive Smoothing Operation 

To avoid the unnecessary computation for inactive regions, we 
reduce the global fine grid to several local fine grids 
corresponding to active regions at finest level and iterate the 
smoothing operator only inside these local fine grids. Although, in 
this paper, only finest grid is not global, we could have several 
levels of non-global grid. The active regions are detected by the 
first order derivative of nodal voltage (4.2). 

With local fine grid, we actually use different time steps for active 
and inactive subnetworks in the sense that inactive subnetworks 
may only get chance to get error smoothed at finest level once 
every several time points. 

5. EXPERIMENTAL flESULTS 

The proposed approach is implemented in ANSI C. The 
experiments arc executed on a SUN BladelOO (300MHz) 
workstation with 2GB memory. 

We set the number of pre-smoothing and post-smoothing iteration 
to 3 at each level and the multigrid iteration termination control 
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residue nom to uio'"- Gauss-Scidcl method acts as smoothing 
operator at each level. 

Table 1 shows the DC analysis niniime of SP1CE3 and our 
approach, whidi is more than 100 times faster than SP1CE3 for 
large circuits. Table 2 compares the transient analysis runtime of 
SP1CE3, non-adaptive AMG, and adaptive AMG. The time- 
varying currents are modeled as triangular waveforms with 2mA 
pesdL current and 40ps rising and falling time [21]. Current 
sources are not evenly distributed and the timings are also 
difTeFcnt We execute the transient analysis for 5ns. Experimental 
results show that adaptive AMG is about 10% faster than non- 
Adaptive AMG. Our approach mns more than 30 times fasXtr than 
SPICE3 for transient analysis. The performance speedup is 
comparable with the PCG approach in [10]. The number of 
multigrid iterdliuns dose nut increase rapidly wiQi the problem 
size. Actually the number of iterations is independent of the 
problem size, which cannot be proved but is observed in many 
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9.8 


2637 


3.91 


0.28 


14 


5105 


15.01 


0.57 


26J 


10322 


54.44 


0.98 


55.5 


40S42 


708.22 


3.93 


180.2 


91562 


X 


8.98 





Table 2. Transient Analysis Rontime Com 


parisoD (sec) 


Nodes 


SPICE 


AMG 


Adaptive AMG 


Adaptive Speedup 


1706 


18.06 


1.48 


1.38 


13.1 


2637 


4103 


4.22 


3.92 


10.5 


5105 


122.1 


9.05 


7.86 


15.5 


10322 


456.42 


19.34 


16.97 


26.9 


40842 


3048.5 


96.5 


87.55 


57.7 



Figure 5.1 compares the transient analysis voltage waveforms of 
one node from SPICE and our adaptive AMG approach. The 
waveforms are almost the same. 




Fig 5.1 Comparison of voltage waveforms from SPICE Qeft) 
and Adaptive AMG (right) 

6. CONCLUSION 

In this paper, wc present an adaptive AMG approach for power 
network analysis. The spatial variation of power supply noise is 
utilized to further speed up the performance. The new approach 
can handle power network of general topology with self and 
mutual inductances. Experimental result is very promising. 
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ATTACHMENT li 



Exemotarv Functions and Applications: 



The Spiceoiego is a fast accurate transistor level simulation tool thai can perfonn full-chip simulation including 
power network and dock network analysis with SPICE-level accuracy and guaranteed convergence. It can 
perform analysis of interconnect delay, crosstalk, voltage drop, ground bounce, simultaneous switching noise 
and EMI 



Exemplary Features and Advantages: 



SPICEoieoo adopts algebraic multigrid method as the solver engine, which avoids the high complexity of Gauss- 
Elimination melhod used in SPICE and slow convergence of basic iterative methods such as SOR and Gauss- 
Seidel. 

SPICEoiego fully explores the circuit spatial and temporal latency as well as the multi-rate behavior to avoid 
unnecessary computation without loss of accuracy and convergence via the novel adaptive extensfon of 
multigrid melhod. r u 

In addition. SPICEoiego has the ability to analyze coupling effect with electromagnetic retardations for high 
frequency chip-package-board scale analysis, while no commercial tools are available to consider the 
electromagnetic retardations. 

Nassda's HSIM and Synopsys's PowerMill / TimeMill / RailMill are relied on circuit partitioning or macro 
modeling method which has potential convergence problem and cannot capture coupling effect such as mutual 
inductance accurately. Apache's Redhawk-SDL is actually cell-based. Simplex (now part of Cadence)'s 
VOLTAGESTORM ignores the inductance effect. 



Section A: SPICEoiego Matrix Solver Engine 

Multigrid Method 

SPICEoiego adopts algebraic multigrid method as the solver engine. The multigrid method is an efficient 
technique first widely used for solving partial differential equations. The basic idea of multigrid method is to 
map the hard-to-damp low frequency error at fine level to easy-to-damp high frequency en-or at coarse level, 
solve the mapped problem at coarse level, and then map the en-or con-ection of coarse level back to fine 
level. A hierarchical grid structure is constnjcted for that purpose. At each level, a fonward iterative 
smoothing operator such as Gauss-Seidel erases high frequency error. There are two kinds of multigrid 
methods: geometric multigrid and algebraic multigrid (AMG). Geometric multigrid method requires regular 
mesh structure. For problems with irregular stmcture, AMG is a good altemative of geometric multigrid 
method. The coarsening and interpolation operation of AMG are based on matrix itself. This overhead 
makes AMG less efficient than geometric multigrid if tiie problem analyzed has regular mesh structure. 
Since digital or mixed-signal circuits usually have irregular stmctures, SPICEoiego adopts the algebraic 
multigrid method. Another reason is that mutual inductance can be easily handled with AMG. Accuracy is 
guaranteed by comparing the norm of residue with user-defined error tolerance. The multigrW iteration stops 
only after the norm of residue is smaller than ttie error tolerance. Promising speedup over SPICE Is 
observed in the experiments. 
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A description of single Mulligrid cyde is shown below. The mapping operator from fine to coarse level is 
called restriction operator if^, coarse to fine level mapping operator is called interpolation operator /j^. 
System equation a'u' = /* is* solved hierarchically at different levels; the superscript refers to the level of the 
grid structure. 
Single Multlgrid-Cycte Scheme 

• Relax on = /* i^, with initial guess , calculate residue r* = - A^v* 

• Compute /" = /j[*r* 

o Relax on a^u^ = /" i', with initial guess k"=0, r" = - A^v^ 
o Compute = /JJr" 

• Relax on A**u** = f*" ''"^ ^"^^ 1"'^^^ S"^* v**=0. r** = /** - A*^v^ 

• Conqwtc « /JJr** 

• Co coarsest level 

• Correct V** 4- v^* + /^v" 

• Relax on = /** limes with initial guess k** 
o Coirect v"^v"+/iV* 

o Relax on = /" y^iimes with initial guess . 

• Correct V* 4- v* + /Jjv" 

Relax on A^u* = /* with initial guess v* 

System Matrix Properties ;j . 

For circuits with passive elements, when only RC mode! is included, system equation can be formulated as 

cx{i)+ax{t)=U{t), (2.1) 

where X is the vector of nodal voltages. 

Appling trapezoidal approximation with time step h to equation (2.1). we have 

(C + ^C)X{t + A) = -<<7 - ^QX{i) + u{t) + u{t + h) (2-2^ 
The left hand side matrix in (2.2) Is symmetric and positive definite which makes the iterative methods 
converge quickly. 

When inductance is included, the system matrix equation becomes: 

cf(/)+6i(i)=(>(0 (2-^^ 

"^'=[-h-[']"^-K]'"'-[''."o1 
Equation (2.3) can be rewritten as: 

Appling trapezoidal approximation with time step h, the solution of equation (2.4) is derived by 

Although MNA method can handle elements without admittance description, the transient analysis system 
matrix in (2.5) is no longer symmetric and positive definite when the Inductance is included because of the 
introduction of cun-ent variables. ^^.^ 
Since Multigrid as well as PCG method requires the matrix to be symmetric positive definite, some extra 
processing is needed to reformulate the system. We split the variable vector into nodal voltage vector and 
branch current vector. Using block matrix operations, we decompose (2.5) Into two iteraUon formulas (2.6) 
for nodal voltages and branch currents, respectively. 
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(^.c-^x/l-i>i,)K(,4*)= (2.6) 

=> 'C* J Y^"'^pK(/)+IC/(i) + (/(< + A)J*2^//(T) 

In equation (2.6), oon^ponds to the K matrix so that the matrix inversion overhead is reduced by 
sparsification methods. 

If the inductance matrix is symmetric and positive definite (S.P.O.). we can prove that the system matrix is 
still S.P.D. This also holds for forvvard Euler, backward Euler integration approximation methods. Note that 
the topology of reformulated system matrix in (2.6) is no longer the same as original circuit topology. 
Geometric based coarse grid reduction algorithm cannot be applied to RLC networi^ directly. 
For circuits with nonlinear transistor devices, though the positive definite properties cannot be guaranteed, 
the system matrix usually exhibits this property especially for circuits with no strong feedback or high-gain 
elements. Thus, the fast AMG solver can be still be used. 

Section B: Exploitation of Latency & Multi-rate Behavior 

Most circuits display spatial and temporal latency as well as multi-rate behavior. Spatial latency means only 
part of circuits has activities at any time point. Temporal latency refers to the situation that a given portion of 
circuit is active in some periods but inactive in other periods. Even active portions have various change rate 
of cun-ent and voltage level. This phenomenon can be utilized to avoid unnecessary computattons. Different 
regions can be simulated with various time step size according to their activities, while SPICE have to apply 
the minimum time step size to the entire circuits for the sake of convergence. Many partition-based 
commercial fast simulators use varies time step size for each subcircuit and solve each sul>circuit 
separately. Though saving a lot of computation overhead, this kind of method is difficult to guarantee the 
convergence and cannot capture mutual inductance coupling effect correctly. 

Instead of trying to directly employ varies time step sizes, SPICEpiego uses adaptive grid stmctures and 
adaptive smoothing operations to capture the multi-rate behavior and circuit latency. Active regions have 
relatively finer grid structure and more en^or smoothing operations than Inactive regions. Convergence is 
guaranteed by the multigrid method, because multigrid converges as long as smoothing operations at each 
level can damp the high frequency enror. 

With the adaptive grid structure and adaptive smoothing, we actually use different 'time step size" for active 
and inactive subcircuits in the sense that inactive subcircuits may only get chance to have error smoothed at 
finest level once every several time points as shown in the figure 3.1 . 




Figure 3.1 Adaptive Grid Structures 

In SPICEoiego. i^ node voltage change in the linear network (power or clock network) between time points is 
less than a threshold M, this node is considered as an idle node. If the change of Vgs, Vds and Vbs for a 
transistor is smaller than thresholds, the transistor is considered inactive. For active portions, different level 
of threshold could be defined to detemnine various degrees of activities that results In multi local fine levels 
of grid structures. 

Section C: Device Model Evaluation 

Nonlinear device model evaluation takes more than 60% of transient simulation mntime and thus a 
bottleneck of the circuit analysis. Many contemporary commercial simulation tools haye an option to use 
simplified device models instead of detailed models such as BSIM3/BSIM4. 

In the SPICEoiego, a table MOSFET model is supported besides detailed BSIM3/BSIM4 models. The model 
table is generated from many detailed model evaluation runs before simulation. It is indexed by Vgs. Vds 
and Vbs and contains Ids, Gds (output conductance Gds = ^ ) and Gm (Transconductance Gm = -^) 

as table items. 
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Experiments show that uniform voltage grid resolution of 60. 60 and 200 for Vgs, Vds and Vbs respectively 
and linear interpolation produce very accurate results compared with detailed models as shown in Figure 
4.1. 
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Figure 4.1 Current drawn from VDD of an inverter using BSIM3 Model (left) BSIM3 Table Model 

(right) 





Figure 4.2 (a) Enror Percentage of Ids vs Vgs (left) Ids vs Vgs (right) 
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Figure 4.2 (b) Error Percentage of Gm vs Vgs (left) Gm vs Vgs (right) 
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Figure 4.2 (c) Error Percentage of Gds vs Vgs (left) Gds vs Vgs (right) 

Figure 4.2 shows the plots for the error percentage and absolute value of Ids, Gm and Gds vs Vgs of NMOS 
transistor in an inverter with a step function voltage at the gate terminal. The devices model is B5IM3v3 with 
TSMC .25 technology parameters. As observed in figure 4.2. table model is accurate when ldsA/ds/6m is 
non-zero. I-arge errors at the left end of Vgs axis are mainly due to the close-to-zero absolute values of 
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Ids/Gm/Gds and hence can be ignored. Moreover, due to the non-uniform distributed first derivative value of 
Ids, Gm and Gds as shown in figure 4.3, non-uniform voltage grid resolution can improve the accuracy 
without requiring more memory. 




(a) (b) (c) 

Figure 4.3 Contours of the (a) Ids (b) Gm (c) Gds with Vgs as X-axis and Vds as Y-axis 



MOSFET capacitance has several options from detailed BSIM3/BSIM4 model to plecewise constant linear 
capacitances. 3-4 Newlon-Raphson iterations are avoided for each device. 

Most of the product features have been Implemented in C on Unix platfomi. Some experiments have been 
conducted for power network analysis. Promising speedup has been obsen^ed in these experiments that are 
executed on a SUN Bladel 00 (300MiHz) workstation with 2GB memory. 

Table 1 shows the DC analysis runtime of SPICE3 and SPICEoiego. which is more than 100 times fester than 
SPICES for large circuits. Table 2 compares the transient analysis runtime of SPICES and SPICEcHegi,. We 
execute the transient analysis for 5ns. Experimental results show that SPICEoiego runs more than 30 times 
faster than SPICES for transient analysis. The speedup increases to 60 times when the size of the circuit 
grows to 40 thousand nodes. The number of multigrid iterations dose not increase rapidly with the problem 
size. Actually the number of iterations is independent of the problem size, which cannot be proved but is 
observed in many cases. 



Tablel DC An alysis Runtime Comparison (sec) 



Nodes 


SPICE 


SPICEoteco 


Speedup 


1706 


1.68 


0.17 


9.8 


2637 


3.91 


0.28 


14 


5105 


15.01 


0.57 


26.3 


10322 


54.44 


0.98 


55.5 


40842 


708.22 


3.93 


180.2 


91562 


X 


8.98 





Tabl e 2. Transient Analysis Runtime Comparison ( sec) 



Nodes 


SPICE 


SPICEoteoo 


Speedup 


1706 


18.06 


1.38 


13.1 


2637 


41.23 


3.92 


10.5 


5105 


122.1 


7.86 


15.5 


10322 


456.42 


16.97 


26.9 


40842 


5048.5 


87.55 


57.7 



Figure 5.1 compares the transient analysis voltage wavefomris of one node from SPICE and SPICEi«3ci. The 
waveforms are almost the same. 
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Figure 5.1 Comparison of voltage waveforms from SPICE (left) and SPICEotego (right) 
Exemplary Commercial Applications 



The SpiceoteoD is a ^ast accurate transistor level simulaUon tool that can perform full-chip simulation including 
power netvrork and clock network analysis with SPICEMevel accuracy and guaranteed convergence^ 
Applications include but are not limited to analysis of interconnect delay, crosstalk, voltage drop, ground 
bounce, simultaneous switching noise and EMI, which Is vital to chip designers. 
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